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A theory of systems with long-range correlations based on the consideration of binary N-step 
Markov chains is developed. In the model, the conditional probability that the i-th symbol in the 
chain equals zero (or unity) is a linear function of the number of unities among the preceding A'' 
symbols. The correlation and distribution functions as well as the variance of number of symbols 
in the words of arbitrary length L are obtained analytically and numerically. A self-similarity 
of the studied stochastic process is revealed and the similarity group transformation of the chain 
parameters is presented. The diffusion Fokker-Planck equation governing the distribution function 
of the L-words is explored. If the persistent correlations are not extremely strong, the distribution 
function is shown to be the Gaussian with the variance being nonlinearly dependent on L. The 
applicability of the developed theory to the coarse-grained written and DNA texts is discussed. 
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I. INTRODUCTION 

The problem of systems with long-range spatial and/or 
temporal correlations (LRCS) is one of the topics of in- 
tensive research in modern physics, as well as in the 
theory of dynamical systems and the theory of proba- 
bility. The LRC-systems are usually characterized by a 
complex structure and contain a number of hierarchic 
objects as their subsystems. The LRC-systems are the 
subject of study in physics, biology, economics, linguis- 
tics, sociology, geography, psychology, etc. P, S S 0- 
At the present time, there is no generally accepted the- 
oretical model that adequately describes the dynamical 
and statistical properties of the LRC-systems. Attempts 
to describe the behavior of the LRCS in the framework 
of the Tsalis non-extensive thermodynamics ^ were 
undertaken in Ref. However, the non-extensive ther- 
modynamics is not well-grounded and requires the con- 
struction of the additional models which could clarify the 
properties of the LRC-systems. 

One of the efficient methods to investigate the corre- 
lated systems is based on a decomposition of the space 
of states into a finite number of parts labelled by definite 
symbols. This procedure referred to as coarse graining is 
accompanied by the loss of short-range memory between 
states of system but does not affect and does not damage 
its robust invariant statistical properties on large scales. 
The most frequently used method of the decomposition 
is based on the introduction of two parts of the phase 
space. In other words, it consists in mapping the two 
parts of states onto two symbols, say and 1. Thus, the 
problem is reduced to investigating the statistical prop- 



erties of the symbolic binary sequences. This method is 
applicable for the examination of both discrete and con- 
tinuous systems. 

One of the ways to get a correct insight into the na- 
ture of correlations consists in an ability of constructing a 
mathematical object (for example, a correlated sequence 
of symbols) possessing the same statistical properties as 
the initial system. There are many algorithms to gener- 
ate long-range correlated sequences: the inverse Fourjer 
transform j^], the expansion-modification Li method 



di, 



the Voss procedure of consequent random addition 
the correlated Levy walks ^J, etc. @. We believe that, 
among the above-mentioned methods, using the Markov 
chains is one of the most important. We would like to 
demonstrate this statement in the present paper. 

In the following sections, the statistical properties of 
the binary many-steps Markov chain is examined. In 
spite of the long-time history of studying the Markov 
sequences (see, for example, 0, 0| and references 
therein) , the concrete expressions for the variance of sums 
of random variables in such strings have not yet been 
obtained. Our model operates with two parameters gov- 
erning the conditional probability of the discrete Markov 
process, specifically with the memory length N and the 
correlation parameter fi. The correlation and distribu- 
tion functions as well as the variance D being nonlin- 
early dependent on the length L of a word are derived 
analytically and calculated numerically. The nonlinear- 
ity of the D{L) function reflects the existence of strong 
correlations in the system. The evolved theory is applied 
to the coarse-grained written texts and dictionaries, and 
to DNA strings as well. 
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II. FORMULATION OF THE PROBLEM 

A. Markov Processes 

Let us consider a homogeneous binary sequence of sym- 
bols, fli = {0,1}. To determine the N-step Markov 
chain we have to introduce the conditional probabil- 
ity P{ai I tti-N, (li-N+i, • ■ • , fli-i) of occurring the def- 
inite symbol a; (for example, = 0) after symbols 
ai-NjCii-N+i, ■ ■ ■ jdi-i- Thus, it is necessary to define 
2^ values of the P-function corresponding to each possi- 
ble configuration of the symbols Qi-n, ai^N^i, . . . , ai_i. 
We suppose that the P-function has the form, 



P{a, = I ai_Ar, a,^N+i, . 



1 ^ 



(1) 



k=l 



Such a relation corresponds to the additive influence of 
the previous symbols on the generated one. The homo- 
geneity of the Markov chain is provided by the indepen- 
dence of the conditional probability Eq. of the index 
i. 

It is reasonable to assume the function / to be de- 
creasing with an increase of the distance k between the 
symbols ai^k and in the Markov chain. However, for 
the sake of simplicity we consider here a step-like memory 
function f(ai-k,k) independent of the second argument 
k. As a result, the model is characterized by three pa- 
rameters only, specifically by /(O), /(I), and N: 



P{ai — I ai_jv, fli-jv+i, 



i) 



1 ^ 



(2) 



fc=i 



Note that the probability P in Eq. (0) depends on the 
numbers of symbols and 1 in the A^-word but is inde- 
pendent of the arrangement of the elements aj_fc. We 
also suppose that 



one can see the symmetry with respect to interchange 
di Gi in the Markov chain. Here di is the symbol 
opposite to Ui, di = 1 — ai. Therefore, the probabilities 
of occurring the words (oi, . . . ,0^) and (ai, . . . ,(2^) are 
equal to each other for any word length L. At L = 1 this 
yields equal average probabilities that symbols and 1 
occur in the chain. 

Taking into account the symmetry of the conditional 
probability P with respect to a permutation of symbols ai 
(see Eq. Q), we can simplify the notations and introduce 
the conditional probability pk of occurring the symbol 
zero after the A'^-word containing k unities, e.g., after the 
word (11. ..1 00^), 

N-k 

Pk = P(aAr+l = I 1^. (W^) 

N-k 



p{l 



2k 

w 



(5) 



with the correlation parameter /i being defined by the 
relation 



M = /(0)-^. 



(6) 



/(0) + /(l) = l. 



(3) 



We focus our attention on the region of p determined 
by the persistence inequality < n < 1/2. In this case, 
each of the symbols unity in the preceding N-word pro- 
motes the birth of new symbol unity. Nevertheless, the 
major part of our results is valid for the anti-persistent 
region — l/2</i<0as well. 

A similar rule for the production of an TV-word 
(oi, . . . , fljv) that follows after a word (ag, ai, . . . , a^-i) 
was suggested in Ref. However, the conditional prob- 
ability Pk of occurring the symbols ojv does not depend 
on the previous ones in the model |^ . 



B. Statistical characteristics of the chain 

In order to investigate the statistical properties of the 
Markov chain, we consider the distribution WL(fc) of the 
words of definite length L by the number k of unities in 
them. 



This relation provides the statistical equality of the num- 
bers of symbols zero and unity in the Markov chain under 
consideration. In other words, the chain is non-biased. 
Indeed, taking into account Eqs. and © and the se- 
quence of equations. 



lla. 



,) = 1-P{a,, = 0\a, 



l) 



hiL) ^ Qi+i, 



1=1 



and the variance of k, 



D{L) = k^-k , 



where 



(7) 



(8) 



1 ^ 

fc=i 



-N) 



P{a, - 



(4) 



/(fc)-^/(fc)W^i(fc). 

fe=0 



(9) 
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If /U = 0, one arrives at the known result for the non- 
correlated Brownian diffusion, 



D{L) = L/4. 



(10) 



We will show that the distribution function Wi(fe) for 
the sequence determined by Eq. (O (with nonzero but not 
extremely close to 1/2 parameter /i) is the Gaussian with 
the variance D(L) nonlinearly dependent on L. However, 
at fJ. 1/2 the distribution function can differ from the 
Gaussian. 



C. Main equation 

For the stationary Markov chain, the probability 
b{aia2 ■ ■ ■ ajv) of occurring a certain word (ai, a2, . . . , ajv) 
satisfies the condition of compatibility for the Chapmen- 
Kolmogorov equation (see, for example, Ref. [ISj): 



(0-1 



■ aN) 



= X! ^(aai ■ • ■aJv-i)^'(aAr I a, ai, . . . ,ajv-i)- (H) 

a=0,l 

Thus, we have 2^ homogeneous algebraic equations for 
the 2^ probabilities h of occurring the A^-words and the 
normalization equation ^6=1. In the case under con- 
sideration, the set of equations can be substantially sim- 
plified owing to the following statement. 

Proposition 4tt: The probability 6(aia2 . . . oat) de- 
pends on the number k of unities in the N-word only, 
i. e., it is independent of the arrangement of symbols in 
the word (ai, a2, . . . , a^v)- 



-o 
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FIG. 1: The probability b of occurring a word (ai, a2, . • . , ajv) 
vs its number z expressed in the binary code, z = X/i=i ' 
2*-\ for AT = 8, ^ = 0.4. 

This statement illustrated by Fig. 1 is valid owing to 
the chosen simple model 0, © of the Markov chain. 



It can be easily verified directly by substituting the ob- 
tained below solution (|15|l into the set Hll|) . Note that 
according to the Markov theorem, Eqs. (|ll|l do not have 
other solutions T^. 

Proposition ^ leads to the very important property 
of isotropy: any word (ai, 02, . . . , ol) appears with the 
same probability as the inverted one, (a^, a^^i, . . . , ai). 

Let us apply the set of Eqs. l(TT|l to the word 
(n_^ 00^): 

k N-k 

6(11 ... 1 00 ... 0) = 6(0 11 ... 1 00 . . . 0)pk 



N-k 



N-k-1 



(12) 



+ 6(in^ oo^)Pfc+i. 

k N-k-1 

This yields the recursion relation for b{k) — 
b{ll^ 00^), 

k N-k 



Pk 



-b{k-i) 



N ~2fi{N ~2k + 2) 
N + 2^{N - 2k) 



b{k-l). 



(13) 



The probabilities b{k) for /z > satisfy the sequence of 
inequalities, 

6(0) = b{N) > 6(1) = b{N - 1) > ... > 6(Ar/2), (14) 

which is the reflection of persistent properties for the 
chain. At n = all probabilities are equal to each other. 
The solution of Eq. I^ll is 

b{k)^A-r{n + k)T{n + N~k) (15) 

with the parameter n defined by 

N{l-2n) 



4^i 



(16) 



The constant A will be found below by normalizing the 
distribution function. Its value is. 



A 



r(l/2 + n) 



2V^F(n)F(2n-f TV)' 



(17) 



III. DISTRIBUTION FUNCTION OF L- WORDS 

In this section we investigate the statistical properties 
of the Markov chain, specifically, the distribution of the 
words of definite length L by the number k of unities. 
The length L can also be interpreted as the number of 
jumps of some particle over an integer-valued 1-D lattice 
or as the time of the diffusion imposed by the Markov 
chain under consideration. The form of the distribution 
function WL{k) depends, to a large extent, on the relation 
between the word length L and the memory length N. 
Therefore, the first thing we will do is to examine the 
simplest case L = N. 
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Statistics of A'^-words 



2. Intermediate case, n > 1 



The value h{k) is the probabihty that an A^-word con- 
tains k unities with a definite order of symbols . There- 
fore, the probability Wiq{k) that an A^-word contains k 
unities with arbitrary order of symbols ai is b{k) mul- 
tiphed by the number C% — N\/k\{N — k)l of different 
permutations of k unities in the A^-word, 

WN{k) = C%b{k). (18) 

Combining Eqs. 1)15(1 and ((18(1 . we find the distribution 
function, 

1 [n)l [n + N) 

The normalization constant Wjv(O) can be obtained from 

N 

the equality J2 Wjyik) = 1, 

k=0 



r(n + iV)r(l/2 + n) 



(20) 



20F T{2n + N) 

Comparing Eqs. (fT^ . (fH^ - lf^ . one can get Eq. l(T7|) for 
the constant A in Eq. p5(l . 

Note that the distribution T40v(fc) is an even function 
of the variable k — k — N/2, 

WNiN - k) ^WNik). (21) 

This fact is a direct consequence of the above-mentioned 
statistical equivalence of zeros and unities in the Markov 
chain being considered. Let us analyze the distribution 
function VFjv(fc) for different relations between the pa- 
rameters N and fj,. 



1. Limiting case of weak persistence, n S> 1 

In the absence of correlations, n oo, Eq. 1(19(1 and 
the Stirling formula yield the Gaussian distribution at 
k, N, N — k:^ 1. Given the persistence is not too strong, 

n>l, (22) 

one can also obtain the Gaussian form for the distribution 
function. 



WN{k) 



1 



: exp 



{k^N/2y 
2D{N) 



(23) 



N 



(24) 



v/27ri?(iV) 
with the /i-dependent variance, 

^ ' 8n 4(1 -2Ai) 

Equation l(23() says that the iV- words with equal numbers 
of zeros and unities, k = N/2, are most probable. Note 
that the persistence results in an increase of the variance 
D{N) with respect to its value 7V/4 at /i = 0. In other 
words, the persistence is conductive to the intensification 
of the diffusion. Inequality n ^ 1 gives D{N) <C iV^. 
Therefore, despite the increase oi D{N), the fluctuations 
of (fc — N/2) of the order of TV are exponentially small. 



If the parameter n is an integer of the order of unity, 
the distribution function Wn (k) is a polynomial of degree 
2{n — 1). In particular, at n = 1, the function WN(k) is 
constant, 



WN{k) 



1 



N+1 



(25) 



At ri 7^ 1, T40v(fc) has a maximum in the middle of the 
interval [0,N]. 



3. Limiting case of strong persistence 



If the parameter n satisfies the inequality. 



n < In ^ iV, 



(26) 



one can neglect the parameter n in the arguments of the 
functions T{n+k), T{n+N), and r{n+N-k) in Eq. 
In this case, the distribution function Wi^ik) assumes its 
maximal values at fc = and k = N, 

nN 

WnII) = Wn{0)j^ « WnIO). (27) 

Formula (|?7jl describes the sharply decreasing WAr(fc) as 
k varies from to 1 (and from N to N — 1). Then, at 
1 < k < N/2, the function W^ik) decreases more slowly 
with an increase in fc, 



WNik) = VFw(O) 



nN 



k{N -k) 



(28) 



At fc = N /2, the probability WN{k) achieves its minimal 
value. 



' N\ An 



(29) 



It follows from normalization l(2U(l that the values 
Wm{Q) — Wm{N) are approximatively equal to 1/2. Ne- 
glecting the terms of the order of n^ , one gets 

WN{Q) = \{l-n\nN). (30) 

In the straightforward calculation using Eqs. © and 
((28(1 the variance D is 

, , N^ nN(N - 1) 
DiN) ^- '-. (31) 

Thus, the variance D{N) is equal to A^^/2 in the lead- 
ing approximation in the parameter n. This fact has a 
simple explanation. The probability of occurrence the 
iV-word containing N unities is approximatively equal 

to 1/2. So, the relations F « N^/2 and fc^ = N^/i 
give ((31(1 . The case of strong persistence corresponds 
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to the so-called ballistic regime of diffusion: if we chose 
randomly some symbol in the sequence, it will be sur- 
rounded by the same symbols with the probability close 
to unity. 

The evolution of the distribution function VFjv(fc) from 
the Gaussian form to the inverse one with a decrease 
of the parameter n is shown in Fig. 2. In the interval 
ln~^ iV < n < 1 the curve WOv(fc) is concave and the 
maximum of function WN{k) inverts into minimum. At 
TV » 1 and In"^ iV < n < 1, the curve remains a smooth 
function of its argument k as shown by curve with n — 
0.5 in Fig. 2. Below, we will not consider this relatively 
narrow region of the change in the parameter n. 

Formulas (HU and (El describe the 

statistical properties of i-words for the fixed "diffusion 
time" L = N. It is necessary to examine the distribution 
fimction WL{k) for the general situation, L ^ N. We 
start the analysis with L < N. 



0,15 



0,10- 



0,05- 



0,00 



FIG. 2: The distribution function WNik) for iV=20 and dif- 
ferent values of the parameter n shown near the curves. 



B. Statistics of i-words with L < N 




The remaining {i — k) unities are situated within the left- 
hand part of the word (within {N — L)-sub-word). The 
multiplier C|_C^_*^^ in Eq. I|32|) takes into account all 
possible permutations of the symbols " 1" within the A^- 
word on condition that the L-sub-word always contains k 
unities. Then we perform the summation over all possible 
values of the number i. Note that Eq. H32|l is a direct 
consequence of the proposition 4|k formulated in Subsec. C 
of the previous section. 

The straightforward summation in Eq. H32|l yields the 
following formula that is valid at any value of the param- 
eter n: 



WL{k)^WL{0)Cl^ 



k)Tin + L-k) 



where 



Wl{0) = 



T{n)T{n + L) 



4" T{l/2 + n)T{n + L) 
F(2n + L) ' 



(34) 



(35) 



It is of interest to note that the parameter of per- 
sistence /i and the memory length N are presented in 
Eqs. H34(l . H35|l via the parameter n only. This means 
that the statistical properties of the L- words with L < N 
are defined by this single "combined" parameter. 

In the limiting case of weak persistence, n ^ 1, at 
k, L — k ^ 1, Eq. H34|l along with the Stirling formula 
give the Gaussian distribution function, 



WL{k) 



1 



with the variance D{L), 
D{L) 



exp 







L 










2n J 


^ 4 



{k-L/2f 
2D{L) 



2fiL 



N{1 ~ 2n) 



(36) 



(37) 



In the case of strong persistence (|26|) . the asymptotic 
expression for the distribution function Eq. can be 
written as 



nL 



k^O, k^L, 



(38) 



1. Distribution function WL{k) 

The distribution function WL^k) &i L < N can be 
given as 

k+N-L 

WL{k)^ J2 mclC^N^. (32) 

i—k 

This equation follows from the consideration of A^- words 
consisting of two parts, 

(ai, . . . , ttN-L, aN-L+i, ■ ■ ■ , (In)- (33) 

V ' " v ' 

i~k unities k unities 

The total number of unities in this word is i. The right- 
hand part of the word (L-sub-word) contains k unities. 



Wl{0) = Wl{L)^ -{l-n\nL). (39) 

Both the distribution VFi,(fc) H38|l and the function 
WN{k) (|28|l has a concave form. The former assumes 
the maximal value (|39|l at the edges of the interval [0, L] 
and has a minimum at fc = L/2. 

2. Variance D{L) 

Using the definition Eq. ^ and the distribution func- 
tion Eq. one can obtain a very simple formula for 
the variance D{L), 

D{L)^j[l + m{L-l)], (40) 
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with 



1 



2^1 



l + 2n N~2^l{N-l)' 



(41) 



Eq. (|40|l shows that the variance D{L) obeys the para- 
bohc law independently of the correlation strength in the 
Markov chain. 

In the case of weak persistence, at n 3> 1, we obtain 
the asymptotics Eq. 1)3 7|) . It allows one to analyze the 
behavior of the variance D{L) with an increase in the 
"diffusion time" L. At small mL <C 1, the dependence 
D{L) follows the classical law of the Brownian diffusion, 
D{L) « L/4. Then, at toL 1, the function D{L) 
becomes super-linear. 

For the case of strong persistence, n <C 1, Eq. H4U|) 
gives the asymptotics. 



DiL) 



nL{L - 1) 
1 2 ' 



(42) 



The ballistic regime of diffusion leads to the quadratic 
law of the D{L) dependence in the zero approximation 
in the parameter n ^ 1. 

The unusual behavior of the variance D{L) raises an 
issue as to what particular type of the diffusion equa- 
tion corresponds to the nonlinear dependence D{L) in 
Eq. (|37|l . In the following subsection, when solving this 
problem, we will obtain the conditional probability p*^"' 
of occurring the symbol zero after a given i-word with 
L < N. The ability to find with some reduced in- 
formation about the preceding symbols being available, 
is very important for the study of the self-similarity of 
the Markov chain (see Subsubsec. 4 of this Subsection). 



3. Generalized diffusion equation at L < N , 1 

It is quite obvious that the distribution WL{k) satisfies 
the equation 

WL+i{k) - WL{k)p'^°Kk) + Wdk - l)p^^\k - 1). (43) 

Here p'^^\k) is the probability of occurring "0" after 
an average-statistical L-word containing k unities and 
p^^\k — 1) is the probability of occurring "1" after an 
L-word containing (fc — 1) unities. At L < N, the prob- 
ability p^^\k) can be written as 



p(o)(fc) 



WL{k) 



k+N-L 

■ E 

i—k 



pM^)GiC^N^:^. 



(44) 



The product 6(z)C|C]y^^ in this formula represents the 
conditional probability of occurring the iV-word contain- 
ing i unities, the right-hand part of which, the L-sub- 
word, contains k unities (compare with Eqs. ((SSI)- 
The product h{i)C'j^'l^ in Eq. gH) is a sharp function 
of i with a maximum at some point i — io whereas pi 
obeys the linear law This implies that pi can be 



factored out of the summation sign being taken at point 
i = iQ. The asymptotical calculation shows that point 
is described by the equation. 



N 



L/2 



1 - 2/i(l - L/N) 



^ 2k 
L 



(45) 



Expression jSJl taken at point iq gives the desired formula 
for pf"' because 



k+N-L 



is obviously equal to WL{k). Thus, we have 



p(")(fc) = i + 



2k 

2 ' N-2fi{N-L) \ 



(46) 



(47) 



Let us consider a very important point relating to 
Eq. 145(1 . If the concentration of unities in the right- 
hand part of the word (|33|l is higher than 1/2, k/L > 1/2, 
then the most probable concentration (zq — k)/{N — L) 
of unities in the left-hand part of this word is likewise 
increased, {io — k)/{N — L) > 1/2. At the same time, the 
concentration {io — k)/{N — L) is less than k/L, 



1 in — k k 
- < — < — . 

2 N - L L 



(48) 



This implies that the increased concentration of unities 
in the i-words is necessarily accompanied by the exis- 
tence of a certain tail with an increased concentration of 
unities as well. Such a phenomenon is referred by us as 
the macro-persistence. An analysis performed in the fol- 
lowing section will indicate that the correlation length Ic 
of this tail is jN with 7 > 1 dependent on the parameter 
H only. It is evident from the above-mentioned property 
of the isotropy of the Markov chain that there are two 
correlation tails from both sides of the L-word. 

Note that the distribution WL{k) is a smooth function 
of arguments k and L near its maximum in the case of 
weak persistence and k, L — k ^ 1. By going over to the 
continuous limit in Eq. (|43|l and using Eq. H47|) with the 
relation p^^\k — 1) = 1 — p^°\k — 1), we obtain the dif- 
fusion Fokker-Planck equation for the correlated Markov 
process. 



dW 1 d'^W d 



where k = k — L /2 and 



2/x 



(1 - 2n)N + 2^1' 



(49) 



(50) 



Equation 1(49(1 has a solution of the Gaussian form 
Eq. I(36() with the variance D{L) satisfying the ordinary 
differential equation. 



dD _ 1 
dL ^ 4 



2ri{L)D. 



(51) 



Its solution, given the boundary condition -D(O) — 0, 
coincides with ((37(1 . 
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4- Self-similarity of the persistent Brownian diffusion 

In this subsection, we point to one of the most in- 
teresting properties of the Markov chain being consid- 
ered, namely, its self-similarity. Let us reduce the A^-step 
Markov sequence by regularly (or randomly) removing 
some symbols and introduce the decimation parameter 
A, 



A = N*/N < 1. 



(52) 



Here N* is a renormalized memory length for the reduced 
iV*-step Markov chain. According to Eq. (|47|l . the con- 
ditional probability of occurring the symbol zero after 
k unities among the preceding N* symbols is described 
by the formula, 



Pk 



2k 



with 



A 



H = 11 



1 - 2/i(l - A) ' 



(53) 



(54) 



The comparison between Eqs. (O and H53() shows that the 
reduced chain possesses the same statistical properties as 
the initial one but it is characterized by the renormalized 
parameters (iV*, ^*) instead of (TV, n). Thus, Eqs. lO 
and H54(l determine the one-parametrical renormalization 
of the parameters of the stochastic process defined by 
Eq. ©. 

The astonishing property of the reduced sequence con- 
sists in that the variance D*{L) is invariant with respect 
to the one-parametric decimation transformation (|52|l . 
(|54|l . In other words, it coincides with the function D{L) 
for the initial Markov chain: 



D*{L)^^[l + m*iL 



1)] = DiL), L < N*. (55) 



Indeed, according to Eqs. (|^ . the renormalized 

parameter m* = 2fi*/[N* - 2n*{N* - 1)] of the re- 
duced sequence coincides exactly with the parameter 
m = 2fi/[N — 2ijl{N — 1)] of the initial Markov chain. 
Since the shape of the function WLik) Eq. is de- 
fined by the invariant parameter n = n* , the distribution 
Wi^ik) is also invariant with respect to the decimation 
transformation. 

The transformation [N , fi) {N* , fj.*) ^ pos- 
sesses the properties of semi-group, i. e., the composition 
of transformations {N, fi) {N* , fi*) and {N*, fi*) 
(iV**, fi**) with transformation parameters Ai and A2 is 
likewise the transformation from the same semi-group, 
{N, fi) {N**, fi**), with parameter A = A1A2. 

The invariance of the function D{L) at i < TV was 
referred to by us as the phenomenon of self-similarity. 
It is demonstrated in Fig. 3 and is accordingly discussed 
below, in Sec. IV A. 



It is interesting to note that the property of self- 
similarity is valid for any strength of the persistency. In- 
deed, the result Eq. (|47|l can be obtained directly from 
Eqs. lfTC|l - lfT7l) . and not only for n > 1 but also for 
the arbitrary value of n. 
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100 



FIG. 3: The dependence of the variance D on the tuple length 
L for the generated sequence with A'' — 100 and fi — 0.4 
(solid line) and for the decimated sequences (the parameter 
of decimation A — 0.5). Squares and circles correspond to 
the stochastic and deterministic reduction, respectively. The 
thin solid line describes the non-correlated Brownian diffu- 
sion, D{L) = L/4. 



C. Long-range diffusion, L > N 

Unfortunately, the very useful proposition 4|k is valid 
for the words of the length L < N only and is not ap- 
plicable to the analysis of the long words with L > N. 
Therefore, investigating the statistical properties of the 
long words represents a rather challenging combinatorial 
problem and requires new physical approaches for its sim- 
plification. Thus, we start this subsection by analyzing 
the correlation properties of the long words {L > N) in 
the Markov chains with iV ^ 1. The two first subsub- 
sections of this subsection mainly deal with the case of 
relatively weak correlations, n ^ 1. 



1. Correlation length at weak persistence 



Let us rewrite Eq. ((SJl in the form, 

\ k=i~N+l / 



(56) 



The angle brackets denote the averaging of the density of 
unities in some region of the Markov chain for its definite 
realization. The averaging is performed over distances 
much greater than unity but far less than the memory 
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length N and correlation length Ic (see Eq. (|60|) below). 
Note that this averaging differs from the statistical av- 
eraging over the ensemble of realizations of the Markov 
chain denoted by the bar in Eqs. ||SJ) and (|5J). Equation 
(|56|l is a relationship between the average densities of uni- 
ties in two different macroscopic regions of the Markov 
chain, namely, in the vicinity of (i -I- l)-th element and 
in the region {i — N, i). Such an approach is similar to 
the mean field approximation in the theory of the phase 
transitions and is asymptotically exact at TV — > cx3. In 
the continuous limit, Eq. H56|l can be rewritten in the 
integral form, 

< a(0 ^ + < ^(f") > d/c - . (57) 

It has the obvious solution, 

< a{i) ~ i >=< a(0) - ^ > cxp {-i/-/N) , (58) 
where the parameter 7 is determined by the relation. 



7 exp 



-1 = 



1 

2^' 



(59) 



A unique solution 7 of the last equation is an increasing 
function of ^ € (0, 1/2). 

Formula (|58|l shows that any fluctuation (the difference 
between < a{i) > and the equilibrium value of 57 = 1/2) 
is exponentially damped at distances of the order of the 
correlation length Ic, 



Ic = iN. 



(60) 



Law (|58|) describes the phenomenon of the persistent 
macroscopic correlations discussed in the previous sub- 
section. This phenomenon is governed by both parame- 
ters, N and ji. According to Eqs. (jHOJ, the corre- 
lation length Ic grows as 7 = 1/45 with an increase in /i 
(at jjL — > 1/2) until the inequality 5 S> l/N is satisfied. 
Here 



5= 1/2 -Ai. 



(61) 



Let us note that the inequality 5 ^ 1/iV defining the 
regime of weak persistence can be rewritten in terms of 
7, 7 < N/A. At (5 « 1/iV, the correlation length 1^ 
achieves its maximum value iV2/4. With the following 
increase of ^, the diffusion goes to the regime of strongly 
correlated diffusion that will be discussed in Subsubsec 3 
of this Subsection. 

At /X — > 0, the macro-persistence is broken and the 
correlation length tends to zero. 



2. Correlation function at weak persistence 

Using the studied correlation properties of the Markov 
sequence and some heuristic reasons, one can obtain the 
correlation function K,{r) being defined as. 



(62) 



and then the variance D{L). Comparing Eq. (|62|l with 
Eqs. 10, l(HJl and taking into account the property of 
sequence, ol = 1/2, it is easy to derive the general rela- 
tionship between functions IC{r) and D(L), 



L-l L-i 



(63) 



Considering (|63|l as an equation with respect to IC{r), 
one can find its solution, 

/C(l) ^ il?(2) - 1, JC{2) = lDi3)-Di2) + l 



= 2 ^^^^ + 1) " 2i:i(r) + D{r - 1)] , r > 3. (64) 

This solution has a very simple form in the continuous 
limit, 



(65) 



Equations (|64|) and 1)41) |l give the correlation function 
at r < TV, n > 1, 



with 



Ci = 1/2, 



3<r<N 



= 1/4, 



and m determined by Eq. 1)411) . In the continuous ap- 
proximation, the correlation function is described by the 
formula. 



m 

T' 



r < N. 



(66) 



The independence of the correlation function of r at r < 
N results from our choice of the conditional probability 
in the simplest form 0. At r > A^, the function IC{r) 
should decrease because of the loss of memory. Therefore, 
using Eqs. H58|) and ()60|l . let us prolongate the correlator 
IC(r) as the exponentially decreasing function at r > A'^, 



TO 



1, 



r < TV, 



4 lexp(-^), r>N. 



(67) 



The lower curve in Fig. ^ presents the plot of the corre- 
lation function at 11 — 0.1. 

According to Eqs. ()65|l . ()67|l . the variance D{L) can be 
written as 



with 



FiL) 



D{L) = ^{l + m.F{L)) 



^ I 2(1 + 7)A^- (1 + 27)^ 




2N± 
L 



1 — cxp ( — 



L<N, 



L> N. 



(68) 



(69) 
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FIG. 4: The dependence of the correlation function K on the 
distance r between the symbols for the sequence with = 20. 
The dots correspond to the generated sequence with = 0.1 
and /i = 50/101. The lower line is analytical result H67|l with 
Ic = 7A'' and 7 = 0.38. 




L 

FIG. 5: The numerical simulation of the dependence D{L) for 
the generated sequence with A'^ = 100 and p = 0.4 (circles). 
The solid line is the plot of function Eq. 16811 with the same 
values of A'' and /i. 



As an illustration of the result Eq. Ht)8|l . we present the 
plot of D{L) for N = 100 and /i = 0.4 by the solid line in 
Fig. [S| The straight line in the figure corresponds to the 
dependence D{L) ~ L/A for the usual Brownian diffusion 
without correlations (for /j, = 0). It is clearly seen that 
the plot of variance (|68|l contains two qualitatively differ- 
ent portions. One of them, ai L N, is the super-linear 
curve that moves away from the line D = L/A with an 
increase of L as a result of the persistence. For L ^ N, 
the curve D(L) achieves the linear asymptotics, 



This phenomenon can be interpreted as a result of the 
diffusion in which every independent step ~ of 
wandering represents a path traversed by a particle dur- 
ing the characteristic "fluctuating time" L ^ (N + Ic). 
Since these steps of wandering are quasi-independent, the 
distribution function WL{k) is the Gaussian. Thus, in the 
case of relatively weak persistence, n ^ 1, WL{k) is the 
Gaussian not only at L < iV (see Eq. (jSHl) but also for 
L>N, I,. 

Note that the above-mentioned property of the self- 
similarity is valid only at the portion L < of the 
curve D(L). Since the decimation procedure leads to 
the decrease of the parameter fj, (see Eq. the plot 

of asymptotics (fTHI for the reduced sequence at L ^ N* 
goes below the D{L) plot for the initial chain. 

3. Statistics of the L-words for the case of strong 
persistence, n <^ In^^ A'^ 

In this subsection, we study the statistical properties 
of long words (L > N) in the sequences of symbols with 
strong correlations. It is convenient to rewrite formula 
for the conditional probability of occurring the sym- 
bol zero after the TV-word containing k unities in the 
form, 

p, = S + 2fi^, (71) 

where v is the number of zeros in the precedent A-word, 
1' = N-k. 

In the case of strong persistence, n <^ liC^ A, the pa- 
rameter 5 = 1/2— ij, is much smaller than 1/A. Therefore, 
the probability can be written as 

(6, = 0, 

P. « <^ i^/N, v^Q, v^N, (72) 

I 1 - (5, v = N. 

It is seen that the probability of occurring the symbol 
zero after the A-word which contains only unities {u = 0) 
represents very small value 5 and it increases significantly 
if ^ 0. This situation differs drastically from the case 
of weak persistency. At n 3> 1, the parameter 5 exceeds 
noticeably the value 1/A, and the probability pi, does 
not actually change with an increase in the number of 
zeros in the preceding A-word. 

The analysis of the symbol generation process in the 
Markov chain in the case of strong persistence gives the 
following picture of the fluctuations. There exist the en- 
tire portions of the chain consisting of the same symbols, 
say unities. The characteristic length of such portions is 
1/6 ^ N. These portions are separated by one or more 
symbols zero. The number of such packets of the same 
symbols in one fluctuation zone is about A. Thus, the 
characteristic correlation distance at which the A-word 
containing the same symbols converts into the A-word 
with I' = A/2 is about N/6, 

A 

Ic^j- (73) 
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The described structure of the fluctuations defines the 
statical properties of the L-words with L > N in the 
case of strong persistence. The distribution function dif- 
fers significantly from the Gaussian and is characterized 
by a concave form at L ^ Ic ^ N/6. As L increases, the 
correlations between different parts of the i-words get 
weaker and the L-word can be considered as consisting 
of a number of independent sub-words. So, according to 
the general mathematical theorems 0, 0|, the distri- 
bution function takes on the usual Gaussian form. Such 
an evolution of the distribution function is depicted in 
Fig. El 
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FIG. 6: The distribution function «)(fc/L) = LWL{k) for N=i 
and 5 — 1/150. Different values of the length L of words is 
shown near the curves. 

The variance D{L) follows the quadratic law D = 
(see Eq. (@21)) up to the range oi L < Ic ^ N/S and 
then approaches to the asymptotics D{L) — BL with 
B - N/4:6 (see Fig.lTJ. 

The upper curve in Fig. ^ presents the correlation 
function for the case of strong persistence {n — 50/101, 
iV = 20). 

IV. RESULTS OF NUMERICAL SIMULATIONS 
AND APPLICATIONS 

In this section, we support the obtained analytical re- 
sults by numerical simulations of the Markov chain with 
the conditional probability Eq. Besides, the proper- 
ties of the studied binary A^-step Markov chain are com- 
pared with those for the natural objects, specifically for 
the coarse-grained written and DNA texts. 

A. Numerical simulations of the Markov chain 

The first stage of the construction of the A^-step 
Markov chain was a generation of the initial non-correla- 
ted A^ symbols, zeros and unities, identically distributed 
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L 

FIG. 7: The dependence of the variance D on the word length 
L for the sequence with A'' = 20 and /j, = 50/101 (solid line). 
The thin solid line describes the non-correlated Brownian dif- 
fusion, D{L) = L/4. 



with equal probabilities 1/2. Each consequent symbol 
was then added to the chain with the conditional prob- 
ability determined by the previous A^ symbols in accor- 
dance with Eq. |(3J). Then we numerically calculated the 
variance D{L) by means of Eq. ||HJ). The circles in Fig. [51 
represent the calculated variance D{L) for the case of 
weak persistence (n = 12.5 ^ 1). A very good agreement 
between the analytical result H68|l and the numerical sim- 
ulation can be observed. The case of strong persistence is 
illustrated by Figs. and [71 where the distribution func- 
tion WL{k) and the variance D{L) are calculated numer- 
ically for n — 4/37 and n = 0.1, respectively. The dots 
on the curves in Fig. ^ represent the calculated results 
for the correlation function IC(r) for n — 0.1 (the upper 
curve) and n = 40 (the lower curve). 

The numerical simulation was also used for the demon- 
stration of the proposition 4|k (Fig. 1) and the self- 
similarity property of the Markov sequence (Fig. 3). The 
squares in Fig. 3 represent the variance D(L) for the 
sequence obtained by the stochastic decimation of the 
initial Markov chain (solid line) where each symbol was 
omitted with the probability 1/2. The circles in this fig- 
ure correspond to the regular reduction of the sequence 
by removing each second symbol. 

And finally, the numerical simulations have allowed us 
to make sure that we are able to determine the parame- 
ters A^ and of a given binary sequence. We generated 
the Markov sequences with different parameters A^ and fi 
and defined numerically the corresponding curves D{L). 
Then we solved the inverse problem of the reconstruc- 
tion of the parameters A^ and /i by analyzing the curves 
D{L). The reconstructed parameters were always in good 
agreement with their prescribed values. In the following 
subsections we apply this ability to the treatment of the 
statistical properties of literary and DNA texts. 
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B. Literary texts 

It is well-known that the statistical properties of the 
coarse-grained texts written in any language exhibit a 
remarkable deviation from random sequences mil. In 
order to check the applicability of the theory of the binary 
A^-step Markov chains to literary texts we resorted to the 
procedure of coarse graining by the random mapping of 
all characters of the text onto the binary set of symbols, 
zeros and unities. The statistical properties of the coarse- 
grained texts depend, but not significantly, on the kind 
of mapping. This is illustrated by the curves in Fig. |S| 
where the variance D{L) for five different kinds of the 
mapping of Bible is presented. In general, the random 
mapping leads to nonequal numbers of unities and zeros, 
ki and fcg, in the coarse-grained sequence. A particular 
analysis indicates that the variance D{L) 1)37(1 gets the 
additional multiplier, 

{ko + kif 

in this biased case. In order to derive the function D{L) 
for the non-biased sequence, we divided the numerically 
calculated value of the variance by this multiplier. 




L 



FIG. 8: The dependence D[L) for the coarse-grained text 
of Bible obtained by means of five different kinds of random 
mapping. 

The study of different written texts has suggested that 
all of them are featured by the pronounced persistent cor- 
relations. It is demonstrated by Fig.|51where the five vari- 
ance curves go significantly higher than the straight line 
D = L/4. However, it should be emphasized that regard- 
less of the kind of mapping the initial portions, L < 80, of 
the curves correspond to a slight anti-pcrsistcnt behavior 
(see insert to Fig. llOf) . Moreover, for some inappropriate 
kinds of mapping (e.g., when all vowels are mapped onto 
the same symbol) the anti-persistent portions can reach 
the values of L ^ 1000. To avoid this problem, all the 
curves in Fig. |3 are obtained for the definite representa- 
tive mapping: (a-m) 0; (n-z) 1. 
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FIG. 9: The dependence D(L) for the coarse-grained texts of 
collection of works on the computer science (m = 2.2 • 10"'^, 
solid line), Bible in Russian (m = 1.9T0~^, dashed line), Bible 
in English (m = 1.5 • 10~^, dotted line), "History of Russians 
in the 20-th Century" by Oleg Platonov (m = 6.4 • 10"*, 
dash-dotted fine), and "Alice's Adventures in Wonderland" 
by Lewis Carroll (m — 2.7 ■ 10~*, dash-dot-dotted fine). 



Thus, the persistence is the common property of the 
binary TV-step Markov chains that have been considered 
in this paper and the coarse-grained written texts at large 
scales. Moreover, the written texts as well as the Markov 
sequences possess the property of the self-similarity. In- 
deed, the curves in Fig. ^| obtained from the text of 
Bible with different levels of the deterministic decima- 
tion demonstrate the self-similarity. Presumably, this 
property is the mathematical reflection of the well-known 
hierarchy in the linguistics: letters — > syllables — s- words 
sentences ~^ paragraphs — *■ chapters —^ books. 

All the above-mentioned circumstances allow us to 
suppose that our theory of the binary iV-step Markov 
chains can be applied to the description of the statistical 
properties of the texts of natural languages. However, in 
contrast to the generated Markov sequence (see Fig. 
where the full length A4 of the chain is far greater than 
the memory length N , the coarse-grained texts described 
by Fig. 1^1 are of relatively short length Ai ^ N. In other 
words, the coarse-grained texts are similar not to the 
Markov chains but rather to some non-stationary short 
fragments. This implies that each of the written texts is 
correlated throughout the whole of its length. Therefore, 
as fae as the written texts are concerned, it is impossible 
to observe the second portion of the curve D{L) parallel 
(in the log-log scale) to the line D{L) — L/4, similar to 
that shown in Fig. [S] As a result, one cannot define the 
values of both parameters N and /i for the coarse-grained 
texts. The analysis of the curves in Fig. 6 can give the 
combination m = 2^/N{l — 2ii) only (see Eq. (|S7|l '). Per- 
haps, this particular combination is the real parameter 
governing the persistent properties of the literary texts. 

We would like to note that the origin of the long-range 
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FIG. 10: The dependence of the variance D on the tuple 
length L for the coarse-grained text of Bible (solid line) 
and for the decimated sequences with different parameters 
A: A = 3/4 (squares), A — 1/2 (stars), and A = 1/256 (trian- 
gles). The insert demonstrates the anti-persistent portion of 
the D{L) plot for Bible. 



correlations in the literary texts is hardly related to the 
grammatical rules as is claimed in Ref. . At short scales 
L < 80 where the grammatical rules are in fact applica- 
ble the character of correlations is anti-persistent (see 
the insert in Fig. Ill() whereas semantic correlations lead 
to the global persistent behavior of the variance D{L) 
throughout the entire of literary text. 

The numerical estimations of the persistent parameter 
m and the characterization of the languages and different 
authors using this parameter can be regarded as a new 
intriguing problem of linguistics. For instance, the un- 
precedented low value of m for the very inventive work 
by Lewis Carroll as well as the closeness of m for the 
texts of English and Russian versions of Bible are of cer- 
tain interest. 

It should be noted that there exist special kinds of 
short-range correlated texts which can be specified by 
both of the parameters, N and /i. For example, all dictio- 
naries consist of the families of words where some prefer- 
able letters are repeated more frequently than in their 
other parts. Yet another example of the shortly corre- 
lated texts is any lexicographically ordered list of words. 
The analysis of written texts of this kind is given below. 



C. Dictionaries 

As an example, we have investigated the statisti- 
cal properties of the coarse-grained alphabetical (lexi- 
cographically ordered) list of the most frequently used 
15462 English words. In contrast to other texts, the sta- 
tistical properties of the coarse-grained dictionaries are 
very sensitive to the kind of mapping. If one uses the 
above-mentioned mapping, (a-m) — s- 0; (n-z) — > 1, the 



behavior of the variance D{L) similar to that shown in 
Fig. 1^1 would be obtained. The particular construction of 
the dictionary manifests itself if the preferable letters in 
the neighboring families of words are mapped onto the 
different symbols. The variance D{L) for the dictionary 
coarse-grained by means of such mapping is shown by 
circles in Fig. ^2 It is clearly seen that the graph of 
the function D{L) consists of two portions similarly to 
the curve in Fig. O obtained for the generated TV-step 
Markov sequence. The fitting of the curve in Fig. Illlbv 
function (|68|l (solid line in Fig. 111!) yielded the values of 
the parameters N = 180 and /i = 0.44. 




L 

FIG. 11: The dependence D{L) for the coarse-grained alpha- 
betical list of 15462 English words (circles). The solid line 
is the plot of function Eq. (55) with the fitting parameters 
N = 180 and /x = 0.44. 



D. DNA texts 

It is known that any DNA text is written by four "char- 
acters", specifically by adenine (A), cytosine (C), gua- 
nine (G), and thymine (T). Therefore, there are three 
nonequivalent types of the DNA text mapping onto one- 
dimensional binary sequences of zeros and unities. The 
first of them is the so-called purine-pyrimidine rule, 
{A,G} ^ 0, {C,T} ^ 1. The second one is the hydrogen- 
bond rule, {A,T} 0, {C,G} 1. And, finally, the third 
is {A,C} ^ 0, {G,T} ^ 1. 

By way of example, the variance D{L) for the coar- 
se-grained text of Bacillus subtilis, complete genome 
(ftp:/ /ftp.ncbi.nih.gov/genomes/bacteria/bacillus^ubti- 
lis/NC_000964.gbk) is displayed in Fig.UHfor all possible 
types of mapping. One can see that the persistent prop- 
erties of DNA are more pronounced than for the writ- 
ten texts and, contrary to the written texts, the D{L) 
dependence for DNA does not exhibit the anti-persistent 
behavior at small values of L. In our view, the noticeable 
deviation of different curves in Fig. 1121 from each other 
demonstrates that the DNA texts are much more com- 



13 




10 1 

10° 10' 10" 10^ 10* 10" 10*" 



L 

FIG. 12: The dependence D{L) for the coarse-grained DNA 
text of Bacillus subtilis, complete genome, for three nonequiv- 
alent kinds of mapping. Sohd, dashed, and dash-dotted 
hnes correspond to the mappings {A,G} 0, {C,T} 1 
(the parameter m = 4.1 ■ 10"^), {A,T} ^ 0, {C,G} ^ 1 
(m = 2.5-10"^), and {A,C} ^ 0, {G,T} ^ 1 (m = 1.5-10"^), 
respectively. 



the X-chromosome of the fruit fly corresponds quaHta- 
tively to Eqs. (EHl, & with /i « 0.35 and N « 250. 
If one draws an analogy between the DNA sequences 
and the Uterary texts, the resemblance of the correlation 
properties of integral literary novels and the DNA texts 
of the Bacteria and Archaea are to be found, whereas 
the DNA texts of the Eukarya are more similar to the 
collections of 10'*-10^ short stories. 




plex objects in comparison with the written ones. Indeed, 
the different kinds of mapping reveal and emphasize var- 
ious types of physical attractive correlations between the 
nucleotides in DNA, such as the strong purine-purine and 
pyrimidine-pyrimidine persistent correlations (the upper 
curve), and the correlations caused by a weaker attrac- 
tion A<->T and C<-^G (the middle curve). 

It is interesting to compare the correlation properties 
of the DNA texts for three different species that belong 
to the major domains of living organisms: the Bacte- 
ria, the Archaea, and the Eukarya J, 9]. Figure [T^ shows 
the variance D{L) for the coarse-grained DNA texts of 
Bacillus subtilis (the Bacteria), Methanosarcina acetivo- 
ransthc (the Archaea), and Drosophila melanogaster - 
fruit fly - (the Eukarya) for the most representative map- 
ping {A,G} 0, {C,T} -> 1. It is seen that the D{L) 
curve for the DNA text of Bacillus subtilis is character- 
ized by the highest persistence. As well as for the written 
texts, the D{L) curves for the DNA of both the Bacteria 
and the Archaea do not contain the linear portions given 
by Eq. (|70|l . This suggests that their DNA chains are 
not stationary sequences. In this connection, we would 
like to point out that their DNA molecules are circular 
and represent the collection of extended coding regions 
interrupted by small non-coding regions. According to 
Figs. ^1^1 the non-coding regions do not disrupts the 
correlation between the coding areas, and the DNA sys- 
tems of the Bacteria and the Archaea are fully correlated 
throughout their entire lengths. Contrary to them, the 
DNA molecules of the Eukarya have the linear structure 
and contain long non-coding portions. As evident from 
Fig. 1131 the DNA sequence of the representative of the 
Eukarya is not entirely correlated. The D(L) curve for 



FIG. 13: The dependence D{L) for the coarse-grained DNA 
texts of Bacillus subtilis, complete genome, the Bacteria, 
(solid line); Methanosarcina acetivorans, complete genome, 
the Archaea, (dashed line); Drosophila melanogaster chromo- 
some X, complete sequence, the Eukarya, (dotted line) for the 
mapping {A,G} 0, {C,T} ^ 1. 



V. CONCLUSION 

Thus, we have developed a new approach to describ- 
ing the strongly correlated one-dimensional systems. The 
simple, exactly solvable model of the uniform binary TV- 
step Markov chain is presented. The memory length N 
and the parameter /i of the persistent correlations are two 
parameters in our theory. The correlation function IC{r) 
is usually employed as the input characteristics for the 
description of the correlated random systems. Yet, the 
function /C(r) describes not only the direct interconnec- 
tion of the elements and a^+r, but also takes into ac- 
count their indirect interaction via other elements. Since 
our approach operates with the "original" parameters N 
and fi, we believe that it allows us to reveal the intrinsic 
properties of the system which provide the correlations 
between the elements. 

We have demonstrated the applicability of the devel- 
oped theoretical model to the different kinds of relatively 
weakly correlated stochastic systems. Perhaps, the case 
of strong persistency is also of interest from the stand- 
point of possible applications. Indeed, the domain struc- 
ture of the symbol fluctuations at n <C 1 is very sim- 
ilar to the domains in magnetics. Thus, an attempt to 
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model the magnetic structures by the Markov chains with 
strongly pronounced persistent properties can be appro- 
priate. 

We would like to note that there exist some features of 
the real correlated systems which cannot be interpreted 
in terms of our two-parametric model. For example, the 
interference of the grammatical anti-persistent and se- 
mantic persistent correlations in the literary texts re- 
quires more than two parameters for their description. 
Obviously, more complex models should be worked out 
for the adequate interpretation of the statistical proper- 
ties of the DNA texts and other real correlated systems. 



In particular, the Markov chains consisting of more than 
two different elements (non-binary chains) can be suit- 
able for modelling the DNA systems. 
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